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J. INTRODUCTION 


A. BACKGROUND 

One of the most difficult problems 1n tropical meteorology is forecasting tropical 
storm intensity. Numerous models for the prediction of tropical cyclone motion are in 
operational use at various tropical cyclone centers (Jarvinen and Neumann, 1979; 
U.S. Command Center/Joint Typhoon Warning Center, 1985). In contrast, there are 
very few aids for forecasting tropical storm intensity changes in operational use today. 
Jarvinen and Neumann (1979) suggest this disparity is due to the difficulty in 
establishing cause and effect relationships for intensity changes. George and Gray 
(1976) have documented the motion response of the tropical cyclones to environmental 
“steering” and significant predictor/predictand correlations have been established. 
Similar well-marked correlations have not been established in the case of intensity 
changes, at least not for the forecast period beyond 24 h. However, a renewed interest 
in intensity forecast techniques has recently developed as motion forecasts have 
improved. | 

Dvorak (1975) developed an empirical technique based on visual satellite imagery 
for estimating 24 h intensity changes. The technique was updated (Dvorak, 1982) to 
Incorporate enhanced infrared and digitized satellite imagery, which extended the 
procedures to nighttime as well as daytime applicability. Unfortunately, this technique 
1s plagued with several limitations and shortcomings. A 24h forecast is of marginal 
operational use in support of flight or maritime operations for which more than 24 h 
leadtime is needed to effectively respond to the threat of a tropical cyclone. In 
addition, this technique 1s somewhat subjective; a trained analyst must match current 
imagery to model storm patterns. Finally, the technique does not handle explosive 
intensification verv well. 

Statistical objective intensity forecast techniques based on conventional storm- 
related data (such as present intensity, latitude, longitude, etc.) were developed bv 
Elsberry et al. (1975) for the western North Pacific and Jarvinen and Neumann (1979) 
for the North Atlantic region. Both studies generated forecast regression equations for 
periods up to 72h, rather than the 24 h forecast period characteristic of the Dvorak 


technique. These techniques basically use a historical sample of storms to develop a 


climatology and persistence forecast of intensity similar to the widely used CLIPER 
track forecast techniques. The basic shortcoming noted in both studies is the 
characteristic failure of the equations to handle the abnormal case, that 1s, the rapidly 
intensifying or decaying storm. Elsberry ez al. claim that we must improve our ability 
to recognize the abnormal case if intensity forecasts are to improve. Jarvinen and 
Neumann suggest we must look beyond the storm-related factors (presumably to 
environmental influences) to increase our ability to forecast intensity changes. Merrill 
(1987), who studied tropical cyclone intensity changes in the North Atlantic basin, 
supports the hypothesis that environmental conditions influence intensity changes of 
tropical cyclones. However, he concludes the linear relationships are very weak and of 
little use as objective forecast aids. 

The purpose of this study is to demonstrate that empirical orthogonal function 
(EOF) representations of the zonal and meridional wind fields and of the vertical wind- 
shear fields can serve as effective predictors of future tropical storm intensity. Shaffer 
(1982) used an EOF analysis to represent 500 mb geopotential height fields on a grid 
centered on a tropical cyclone. Shaffer and Elsberry (1982) demonstrated that 
coefficients from EOF analysis could be used as synoptic forcing predictors in 
Statistical-synoptic track prediction schemes. In a similar study, Wilson (1984) used 
EOF analysis to represent the 700, 400 and 250 mb wind component fields on a refined 
grid centered on the cyclone. Wilson (1984) also showed that the coefficients from the 
wind EOF analysis could be used as synoptic forcing predictors in a statistical track 
prediction scheme. Schott (1985) used data stratified by past motion to show that the 
coefficients of the wind EOF analysis could be used as synoptic forcing predictors in a 
Statistical adjustment technique to reduce the systematic errors of a dynamical track 
prediction model. Meanor (1987) used Wilson’s wind component fields to generate 
EOF fields of vertical wind shear. Using Schott’s stratification scheme, Meanor 
demonstrated that the coefficients from the EOF analysis of wind shear also could be 
used as synoptic forcing predictors in a Statistical adjustment technique to reduce 


systematic errors of a dynamical track prediction model. 


B. OBJECTIVES AND GOALS 

The primary objective of this study is to use the existing “conventional” data 
base; EOF coefficients of wind fields (Wilson, 1984) and wind-shear fields (Meanor, 
1987); and selected intensity information to generate useful 24, 48 and 72 h intensity 


prediction equations for tropical cyclones in the western North Pacific region. 
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Admittedly, microwave satellite data, cirrus streamer information, sea-surface 
temperature data, aircraft reconnaissance data and landfall data could also provide 
meaningful information in intensity forecasting. The goal of this study is to take the 
first step in developing improved 24, 48 and 72 h intensity prediction schemes for 
tropical cyclones in the western North Pacific. The eventual goal is to provide an 
expert system or decision-tree approach, similar to that investigated by Peak and 
Elsberry (1987) for tropical storm motion, that could be used by the Joint Typhoon 
Warning Center (JTWC) for operational intensity forecasting. 
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II. DATA CASE SELECTION 


A. DATA DESCRIPTION 

The cases in this study are a subset of the cases Wilson (1984) and Meanor 
(1987) used. These 12-hourly data are for tropical storms in the western North Pacific 
region for the period from 1979 to 1983. The following restrictions apply to the 
selection of these cases: 


¢ Tropical storms must be located in the Eastern Hemisphere, east of 100° E with 
a Warning position less than 34.6° N; 


e Storm intensity must be at least 18 m/s (35 kt); and 


e Zonal and meridional wind components must be available at 700, 400 and 250 
mb levels. 


A total of 1357 cases meet these requirements. 


1. Original Cases (Wilson/Meanor) 
a. Conventional Data 
The conventional data include observation date/time, storm number and 
Warning positions (current; forecast 24, 48 and 72h). Additional warning-based 
information is available as zonal speed, meridional speed and horizontal displacement 
for three periods: (1) from 12h prior to observation time until observation time, 
(2) from 24 h prior to observation time until observation time, and (3) from 24 h prior 
to observation time until 12 h prior to observation time. Best track positions (current; 
past 12 and 24 h; and future 24, 48 and 72 h) are also available. 
b. Empirical Orthogonal Function Coefficients of Wind and Vertical Wind Shear 
The data set for each 12h case also includes the empirical orthogonal 
function coefficients of the zonal and meridional wind fields at three levels (Wilson, 
1984) and the zonal and meridional shear fields across three layers (Meanor, 1987). 
The wind information used by Wilson (1984) and Meanor (1987) is from the Global 
Band Analysis (GBA) operationally generated by the U.S. Navy at the Fleet 
Numerical Oceanography Center (FNOC). The GBA fields are plotted on a Mercator 
grid girding the globe from 41° S to 59.8° N, with a grid spacing of 2.5° lat by 2.5° long 
at 22.5° N and S. The zonal and meridional fields are available from 00 GMT and 
12 GMT at the surface, 700. 400, 250 and 200 mb. Surface analyses are based on land 


observations and ship reports, while upper-air analyses are based on rawinsonde 
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observations, aircraft reports and satellite-derived cloud motion vectors. Temperature 
analysis at the intermediate levels are used to couple the winds at adjacent vertical 
levels via the thermal wind relationship. The 12h old analvsis plus 5% climatology is 
used as the first-guess field for the current analysis. If no observations are available in 
a region, the final analysis becomes the previous analysis adjusted towards climatology. 

Wilson (1984) defined a relocatable, geographicallv-oriented grid of 
527 points with a fixed zonal and meridional separation of 277.8 km (150 n mi). There 
are 31 points west to east and 17 points north to south. Thus, the domain is 8334 km 
(4500 n mi) by 4445 km (2400 n mi). The grid center (row 9, column 16) is coincident 
with the tropical cyclone center in each case. Wilson used a bi-linear interpolation 
scheme to extract the zonal and meridional component winds at 700, 400 and 250 mb 
from the GBA. 

Lorenz (1956) first applied empirical orthogonal function analysis to 
geophysical fields. It has been used regularly to efficiently describe the variability in 
atmospheric fields. With EOF representations, a large percentage of the variance in a 
data field can be described by the summation of relatively few orthogonal eigenvectors 
and their associated coefficients (eigenvalues). This results in a significant reduction in 
the computer storage space needed to describe synoptic fields, which are ordinarily 
defined by numerous grid point values. 

Wilson generated EOF representations of the zonal and meridional wind 
fields at three levels (700, 400 and 250 mb) and applied a Monte Carlo approach to 
select those small sets of rank-ordered eigenvectors and their associated coefficients 
that describe the signal in the original fields. For this study, the first 35 coefficients of 
the zonal and the meridional wind fields at each level are available for the 1357 cases. 
Wilson showed no less than 90% of the variance in all of the zonal wind fields and 
82% of the variance in all the meridional wind fields to be explained by the first 35 
eigenmodes. The first 25 coefficients of the zonal and meridional wind fields are used 
as potential predictors in this study. 

Following Wilson’s methods, Meanor (1987) generated the EOF 
representations for the zonal and meridional wind-shear fields across three lavers: 
upper (250-400 mb), lower (400-700 mb) and deep (250-700 mb). Meanor also applied 
a Monte Carlo approach to select those small sets of rank-ordered eigenvectors with 
their associated coefficients that describe the variance in the wind-shear fields. The 


first 35 coefficients for the zonal and meridional shear fields across the three lavers are 
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available for the 1357 cases. Meanor showed no less than 80% of the variance in the 
zonal wind-shear fields and 79% of the variance in the meridional wind-shear fields to 
be explained by the first 25 and 35 eigenmodes, respectively. The first 25 coefficients of 
the zonal and meridional wind-shear fields are used as potential predictors in this 
study. 
2. Combined-data cases 

In this study intensity data are added to the data set used by Wilson and 
Meanor. These intensity data are extracted from the Annual Tropical Cyclone Reports 
for 1979 through 1983 published by the Joint Typhoon Warning Center (JTWC) 
and include: 


e Best track intensity (current, past 12h, past 24h, and subsequent 24, 48 and 
To My: 


e Warning intensity (current and past 12 h); and 

® JTWC official forecast intensity (24, 48 and 72 h). 
From these values, best track (past 12h; and future 24, 48 and 72h) and JTWC 
forecast (24, 48 and 72 h) intensity change data are computed. There are 1216 cases in 


the combined-data cases (Wilson/Meanor data plus intensity data) for use in this study. 


B. LAND-OCEAN SORTING 

Only storms over the ocean and within the region bounded by the equator, 
100° E, 34.6° N and 180° are used in this study. The combined data set is subjected to 
the following simple land-sea sorting process to separate cases for storms positioned 
over ocean from cases for storms affected by land. 

The bounded region is subdivided into one degree latitude by one degree 
longitude grid squares, as in Fig. 2.1. If the current or 12h old position is within a 
land square or outside the bounded region, the associated 24, 48 and 72 h intensity 
data are eliminated from the sample. If the current and 12h old positions are within 
ocean squares, the positions of the storm at the subsequent forecast times (24, 48 and 
72h) are evaluated. If the position at any of these times is within a land square or 
outside the bounded region, the intensitv change data at that time and all subsequent 
times are considered unrepresentative and eliminated from the sample. 

An example is illustrated in Figure 2.1 based on Typhoon Nelson from March 
1982. The current (24/00 GMT), 12 hour old (23/12 GMT) and t+ 24 h (25/00 GMT) 
positions of the storm are within ocean grid squares. Because the t+48 h position 1s 


located within a land square, the t+48h and t+72h intensity change data are 
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removed from the sample that is used to derive the regression equations and verify th 
intensity forecasts. 
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III. REGRESSION APPROACH 


The approach in this study is to use regression analysis techniques to investigate 
the predictive skill of EOF coefficients of wind and vertical wind shear in forecasting 
24, 48 and 72 h changes in tropical storm intensity. The UCLA Biomedical Computer 
Program (Dixon and Brown, 1985), entitled BMDP2R, is used to select the predictors 
and to develop the regression model. Tables 1, 2 and 3 are lists of the potential 


predictors considered for use in the regression equations. 


A. POTENTIAL PREDICTORS 

The potential conventional predictors are listed in Table 1. The first three 
predictors (1-3) are the current Julian date and the JTWC warning position (latitude 
and longitude). The next nine predictors (4-12) describe the storm translation during 
the past 24h in terms of the zonal velocity, the meridional velocity and the total 
displacement. Additional predictors (13-22) in this group include warning intensity 
data (current, 12h old and past 12h change); best track intensity data (current, 12 h 
old and past 12 h change); and best track position data (current and 12 h old, latitude 
and longitude). 

The second set of potential predictors (23-172), which are listed in Table 2, are 
the wind-based EOF coefficients generated by Wilson (1984). These represent the 
external forcing on the cyclone by the environmental winds at three levels (700, 400 
mide sO ioe lhe format used to identify these predictors is CLWNN; where 
C indicates a wind-based coefficient, L indicates the level (2 for upper, 250 mb; 4 for 
middle, 400 mb; and 7 for lower, 700 mb), W indicates the zonal or the meridional 
component wind field (U for meridional, V for zonal wind), and NN is a coefficient 
number from | to 25. 

The third set of potential predictors (173-383), which are listed in Table 5, are the 
wind-shear EOF coefficients generated by Meanor (1987). These represent synoptic 
forcing upon the storm by vertical differences in the environmental wind through three 
layers. The format used to identify these potential predictors is SLLWNN; where 
S indicates a wind-shear coefficient, LL indicates the layer (47 for the lower layer, 
which is 400 minus 700 mb; 24 for the upper layer, which is 250 minus 400 mb; and 
27 for the deep layer, which is 250 minus 700 mb) and NN is a coefficient number from 
tOi2 9, 
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B. REGRESSION ANALYSIS 

To predict changes in tropical storm intensity over 24, 48 and 72h, a stepwise 
regression analysis is used. The BMDP2R program computes estimates of the 
parameter through a multiple linear regression in a stepwise manner by entering or 
removing variables one at a time from a list of potential predictors. At each step in the 
BMDP2R regression analysis routine, the predictor that has the highest partial 
correlation with the predictand (given the previous selection of predictors) is selected 
from the remaining set. Consequently, the predictand is the result of a sum of 
uncorrelated independent variables (Dixon and Brown, 1985). 

The F-to-enter value is a function of the number of variables available for 
selection, their correlation structure and the sample size. In this study, the selection 
continues until the new predictor does not meet a minimum F-to-enter value of 4.0. 

The coefficient of multiple determination (R) 1s a measure of the relationship 
between the independent and the dependent variables in the regression model and 
represents the amount of total variance in the predictand that is explained by the 


independent variables, 

R* = SSR / SSTO = 1 -(SSE / Somoe (3.1) 
Where SSR is the regression sum of the squares, SSTO is the total sum of the squares 
and SSE is the residual sum of the squares. To further restrict the number of 


predictors in the equations, only those predictors that increase R* by at least 0.01 are 


retained. Finally, an arbitrary limit of ten predictors is set. 
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Potential conventional predictors 


Name 
DAYJUL 


LAT 
LON 
VXOO1Z2 
VYOOT2 
VOO12 
VXA0024 
VY0024 
VO0O024 
VA12Z24 
VY1224 
V1i224 
WIOO 
WIM12 
DWIM10 
BIOO 
Bah 
DBIM10O 
BLAT 
BLON 


BLTM12 
BLMM12 


available for the regression analysis. 


Description 


Julian date 

Warning position { latitude) 

Warning position ( longitude) 

Zonal storm speed from -12 h 
to 00 h (km/h) 

Meridional storm speed from 
= Z ae On OO ( km) 

Total storm movement from 
-12 h to 00 h (km) 

Zonal storm speed from -24 h 
to 00 h (km/h) 

Meridional storm speed from 
-24 h to 00 h (km/h) 

Total storm movement from 
“24 h to 00 h (km) 

Zonal storm speed from -24 h 
Eome2e hie km/h) 

Meridional storm speed from 
“24 h to -12 h (km/h) 

Total storm movement from 
-24 h to -<12 h (Km) 

Warning OO h intensity 

Warning 12 h old intensity 


Warning -<12 h to OO h change in 


intensity 
Best track 00 h intensity 
Best track 12 h old intensity 


Best track -<12 h to 00 h change in 


intensity 
Best track 


Best track 00 h position ( longitude) 
Best track -12 h position ( latitude) 
Best track -12 h position (longitude) 


iy 


00 h position (latitude) 


LABS 


Potential wind EOF coefficient predictors 
available for the regression analysis. 


Number 
23-47 


SAS A 


[3-20 


98-122 


123-147 


148-172 


Name 
C2U1-25 


CZ Zs 


C4U1-25 


C4V1le25 


C7U1<25 


CTV £225 


Description 


250 mb wind coefficients 
for zonal modes 1 = 25 
250 mb wind coefficients 
for meridional modes 1 
400 mb wind coefficients 
for zonal modes 1 - 25 
400 mb wind coefficients 
for meridional modes l 
700 mb wind coefficients 
for zonal modes 1 - 25 
700 mb wind coefficients 
for meridional modes 1 
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derived 


derived 
- 25 
derived 


derived 
- 25 
derived 


derived 
- 25 





TASES 3 


Potential wind-shear EOF coefficient predictors 
available for the regression analysis. 


Number - 
173-197 


198-222 


223-247 


248-272 


ZS] 


298-322 


Name 
S$47U1=-25 


547V1-25 


924U1=-25 


S2Z4V1=-25 


9Z27U1-25 


$Z27V1=25 


Description 


400 minus 
derived 
400 minus 
derived 
250 minus 
derived 
250 minus 
derived 
250 ominous 
derived 
250 minus 
derived 


2] 


FOO 
eis 
700 
Lor 
400 
IB(ONe 
400 
fr Oe 
700 
meds 
ZOO 
ors 


mb shear coefficients 
zonal modes 1 = 25 

mb shear coefficients 
meridional modes 1 = 25 
mb shear coefficients 
zonal modes 1 - 25 

mb shear coefficients 
meridional modes 1 = 25 
mob shear coefficients 
zonal modes 1 = 25 

mb shear coefficients 
meridional modes 1 = 25 





IV. STUDY METHODS AND VERIFICATION OF RESULTS 


A. BASIC METHODOLOGY 
The purpose of this study is to investigate the usefulness of empirical orthogonal 
function coefficients as predictors in an objective forecast scheme of the 24, 48 and 
72 h western North Pacific tropical storm intensity. The basic four-part approach is 
illustrated in Fig. 4.1 and discussed in the following four subsections. 
1. Select Data Cases 
This study involves the application of regression analysis techniques (Chapter 
III) to various groupings of the 1216 data cases in the combined-data set (Chapter 11). 
Several groupings of the data are investigated in this study: 
¢ A-complete dependent data set (all 1216 cases); 
° Dependent-case/Independent-case subsets; and 
¢ Subsets stratified by previous 12 h intensity. 
The application of the basic study approach to these data groupings is addressed in 
Section B of this chapter. 
2. Screen Potential Predictors 
Because the number of cases in any of the data groupings is small relative to 
the number of the potential predictors, the potential predictors are screened to 
determine which are dominant. The predictors are divided into three categories: 
¢ CONV Category - The conventional data listed in Table 1; 


¢e WIND Category - The first 25 EOF coefficients of the zonal and menidional 
wind fields at three levels (700, 400 and 250 mb) listed in Table 2; and 


¢e SHEAR Category - The first 25 EOF coefficients of the zonal and meridional 
vertical wind-shear fields across three lavers (400-700, 250-400, and 250-700 mb) 
listed in Table 3. 


For each predictand (24, 48 or 72 h intensity change) in each data set/subset, a series 
of three 10-step regression analyses is performed based on each of the three categories 
of predictors. 
3. Generate Regression Equations 
The predictors selected during the screening procedure {a maximum of 30 
predictors: up to 10 from each of the three regression analyses) are consolidated. A 
final 10-step regression is performed using these screened predictors to generate the 


final equation for the predictand in question. 
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4. Verify Regression Equations 
The regression-derived equations for intensity changes are used to compute 


forecast intensities at time tt: 
Ritt = WI00 + Ditt, (4.1) 


where WI00 is the warning intensity at observation time, DItt is the regression-derived 
change in intensity over the forecast interval, and RItt is the regression-derived forecast 
intensity at verification time tt. The performance of the final regression equations 1s 
verified relative to the performance of: 

e The JTWC official forecast; and 

e A persistence forecast. 
The means and standard deviations of the absolute value of the intensity error in the 
regression, the JTWC and the persistence forecasts are computed and compared. A 
Student-T test is applied to determine which schemes provide significant improvement 


at the 95% confidence level. 


B. APPLICATION OF THE METHODOLOGY 
1. Complete dependent data set 

The approach outlined above is first applied to the complete (dependent) data 
set, 1.e., all 1216 cases. Various combinations of best track and/or warning predictors 
are considered for use as the conventional predictors (CONV). The following 
combination of eight predictors is chosen: date, best track position (current and 12h 
old; lat and long), best track intensity (current and 12 h old) and best track past 12h 
change in intensity. This combination explains the greatest variance in the intensity 
change at the three forecast periods and it has the smallest number of missing values. 

The predictors selected during the screening process for each of the forecast 
periods are listed in Tables 4, 5 and 6 for the CONV, WIND and SHEAR category 
predictors, respectively. The number of potential screened (SCRN) predictors available 
for each final regression equation is reduced to a maximum of 28 for each 
predictand: eight conventional predictors, ten wind and ten wind-shear EOF 
coefficients. 

Only three of the eight potential conventional predictors are selected for any 


one of the three equations (Table 4). The number of predictors selected 1s limited by 
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TABLE 4 


CONV predictors selected after screening regression 
on 24, 48 and 72 h best track intensity change (kt) 
with the complete dependent data set (1216 cases). 
The numbers indicate the order in which predictors 
are selected for each equation. The coefficients 
of multiple determination (R**2) are shown 
for each equation. 


Forecast Interval 
Predictor 24 48h 72h 


DAYJUL S 
BLAT 

BLTM12 

BLNM12 
BaM 1 Z 

DBIM10 


Fee, 





the minimum F-to-enter and change in R* requirements applied to the subsequent 
predictors. The 12 h old intensity, rather than the current intensitv, is the first 
predictor selected at all three forecast periods. At 24 and 48 h, the past 12 h change in 
intensity is the second predictor selected. This combination of predictors corresponds 
to a two predictor equation for the extrapolation of the intensity trend. There is no 
consensus on the additional conventional predictors that are chosen for the three 
forecast intervals. 

Of 150 potential wind EOF coefficient predictors, seven are selected for the 
24 h equation and ten are selected for the 48 and the 72 h equations (Table 5). The 
coefficient of the first eigenmode of the zonal wind at 400 mb (C4U 1) 1s the first 
predictor selected in all three equations, while the coefficient of the second eigenmode 
of the zonal wind at 400 mb (C4U 2) is selected second in the equations at 24 and 72 h 
(and fourth at 48h). Wilson (1984) suggests the patterns of modes | and 2, which 
account for the largest variance in the zonal and meridional wind fields, can be 


interpreted separately as representing particular atmospheric flow patterns. He states 
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TABLE 5 


WIND predictors selected after screening regression 
on 24, 48 and 72 h best track intensity change (kt) 
with the complete dependent data set (1216 cases). 
The numbers indicate the order in which predictors 
are selected for each equation. The coefficients 
of multiple determination (R**2Z2) are shown 
for each equation. 


Forecast Interval 


Predictor 24h 48h 72 1 
C2U14 7 
C2V 25 6 
C4U 1 ale 1 1 
C4U 2 Zz 4 2 
C4U16 9 10 
C4U20 7 
C4V12 6 4 
Ci 7 3 6 
Gel sey 8 8 
C7U24 9 
C7 yes 3 Z q 
Cyn Vd 5 
ey 13 5 
C7V14 5 3 
C7V16 le 
R**2 GO. 26 0.32 O.356 


that the complexity of the eigenvalues makes it difficult to associate higher order 
modes for any of the fields with any observable atmospheric patterns. If the 400 mb 
conditions can be assumed to represent the mean flow through the depth of the 
troposphere, the first coefficient of the zonal wind 1s indicative of the mean Zonal 
environmental flow. A positive value (related to easterly flow) generally may be 
associated with storm development, while a negative value (related to westerly flow) 


would imply recurvature and associated weakening. 
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TABLE 6 


SHEAR predictors selected after screening regression 
on 24, 48 and 72 h best track intensity change (kt) 
with the complete dependent data set (1216 cases). 
The numbers indicate the order in which predictors 
are selected for each equation. The coefficients of 
of multiple determination (R**2) are shown 
for each equation. 


Forecast Interval 
Predictor 24h 48 h a 
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S47U 
S47U 
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S47V15 


szZz4U 1 
S24U 2 


SZ4VZ0 6 


52 /¥ oi ZZ 6 
S2Z27V 4 S 

52) 

OZ / Van > 
oz / Vee 

22 ao 

li 122 


FeZ QO. 16 CmL6 On 22 


= 
4 


CO Wb WN - 


fe 


is 


W O © 


Of the 150 potential wind-shear coefficients, six are selected for the 24 and 
48 h equations and nine for the 72 h equation (Table 6). In contrast to the selected 
conventional and wind EOF coefficient predictors, the wind-shear EOF coefficients are 
less consistent in time. None of the wind-shear coefficients are selected for all three 
equations (24, 48 and 72 h). Only four wind-shear predictors (S47U 1, $47U 2, S24U 1 
and $27V 1) are selected for two of the three equations. 
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Notice that the explained variance increases with increasing forecast interval 
for all three categories of potential predictors. The conventional predictor equations 
account for the most explained variance, while the wind-shear EOF predictor equations 
account for the least explained variance. Of the nine equations, the conventional 
predictor equation for the 72 h forecast intensity explains the most variance. 

Before combining the screened predictors and doing a final regression using 
these selected screened predictors, the performance of the equations derived from the 
three separate categories of predictors is investigated (Table 7). Analysis of Table 7 
shows that the mean intensity forecast error and the standard deviation of the intensity 
forecast error increase as the forecast interval increases for all schemes (JTWC; CONV, 
WIND and SHEAR predictor). For all forecast intervals, the equations generated 
using only the best track conventional predictors perform better (have smaller average 
absolute errors) and are more consistent (have smaller standard deviations in the 
average absolute error) than the equations generated using only wind EOF coefficient 
predictors, which perform better and are more consistent than the equations generated 
using only wind-shear EOF coefficient predictors. Although the official JTWC 
intensity forecast errors are smaller than all the regression-derived equations at 24h, 
the best track conventional predictor equations perform better and are more consistent 
than JTWC at 48 and 72h. Recall that these results are for a dependent sample. 
Presumably, even more accurate predictions are possible if all three categories of 
screened predictors are included. 

The three screened-predictor regression-derived equations for the 24, 48 and 
72h intensity change and the coefficient of multiple determination (R*) for each 
predictor are indicated in Table 8. For all forecast intervals, the regression process 
terminates before ten predictors are selected, because the F-to-enter values or the 
amount of variance explained by the subsequent predictors are too small for further 
stepping. Only 4, 6 and 5 predictors are selected at 24, 48 and 72h, respectively. As 
suggested by Table 4, the 12 h old best track intensity is the first predictor chosen for 
all three forecast intervals (24, 48 and 72h). This observation prompted a later 
Stratification of the data (to be discussed in Section B.3 below) based upon 12h old 
best track intensity. Several wind EOF coefficient predictors appear in the screened 
predictor equations. In fact, C4U 1 and C4U 2 are among the top four predictors in 
all three equations. No EOF coefficients of wind shear are chosen. As Meanor (1987) 


suggests, perhaps this is due to the close relationships between the wind and wind- 


TABLE 7 


Verification of JTWC and regression-derived 
(CONV-, WIND- and SHEAR-predictor) forecasts 
of 24, 48 and 72 h tropical storm intensity (kt) 
for the complete dependent data set (1216 cases) 
based on land-filtered and homogeneous samples. 


JTWC Forecast Intensity 


Avg Abs Sica 

Cases BE Eeor Dev 

24h 886 eorenl eas 
48 h Son: Ba 3 WIS 1 
a2 462 24.5 LSoe@ 


Best Track Conventional Predictors (CONV) 


Avg Abs Std 

Cases Eee or Dev 

24h 886 Loa 1S 
48 h 64 20.9 0 
G2 Nn 462 IIMS eZ 


Wind EOF Coefficient Predictors (WIND) 


Avg Abs Sea 

Cases ies ©yts Dev 

24h 886 14.6 Zoe 
48 h eS 1 Bae lier. 
ae al 462 Zone oS 


Shear EOF Coefficient Predictors (SHEAR) 


Avg Abs > Ea 

Cases ia aons Dev 

24h 886 15.4 12.8 
48h Gul 24.5 Lory 
jan 462 AeeeZ ZO «6 


shear synoptic forcings. After a wind EOF coefficient predictor is selected, the wind- 


shear EOF coefficients that are highly correlated with it will not be selected. 
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TABLE 8 


Regression equations for the change in intensity 
(kt) at 24, 48 and 72 h using the complete depen- 
dent data set (1216 cases). Parenthetical values 
indicate the order in which the screened predictors 
are selected for each equation. The coefficients 
of multiple determination (R**2) are shown. 


Forecast Interval 


24h 48h i2e 
Y-Intercept 15. 64 -0.97 48.71 
Predictor 
BLNM12 ~ O.28 (6) o 
BIM12 -2.24 (1) =. 57 -(1) -0.78 (1) 
DEB Iie 0.46 (2) ~ ~ 
C2U14 ~ =1..95 () = 
C4U 1 0.54 (3) - Lwos @. 1. 32a 2)) 
C4U 2 0738) 4) 0.71 (3) 1.11 (4) 
C4V12 ~ = 1.31 (SS 
C7Un7 = -1.36 (4) “1-71 (33 
Cases 886 684 5a 
R**2 0.39 Os i 0.56 


Interestingly, the regression-derived equations explain a larger percentage of 
variance in the predictand with increasing forecast interval. This 1s a favorable result 
because the objective is to provide forecast guidance at 48 and 72 h. However, notice 
that the maximum value of explained vanance (at 72 h) 1s only 56%; 1.e., 44% 1s still 
unexplained. 

The performance of the SCRN predictor equations relative to the performance 
of homogeneous samples of persistence, JTWC and CONV predictor forecasts is 
illustrated in Table 9. At all forecast hours, the smallest mean absolute errors are 
associated with the regression-derived intensity forecasts generated using the SCRN 
predictor equations. In addition, the standard deviations of mean absolute errors 
associated with these equations are the smallest, which indicates more consistent 


forecasts. 


TABLE 9 


Verification of persistence, JTWC and regression- 

derived (CONV- and SCRN-predictor) forecasts for 

24, 48 and 72 h tropical storm intensity (kt) for 

the complete dependent data set (1216 cases) based 
on land-filtered and homogeneous samples. 


Persistence Forecast 


Avg Abs SEC 

Cases Id guelaye Dev 

24h 886 ez eS. 7 
48h Siow Zion 19.8 
aa 462 Seo, Zon ® 

JTWC Forecast 

Avg Abs SiC 

Cases Dgapecoue Dev 

24h 886 130m iS. 
48 h 651 ZS eS 
eZ, 24.5 Lo. 


h 462 


Regression-Derived Forecast 


Best Track Conventional Predictors (CONV) 


Avg Abs SEC 

Cases Eas © Dev 

24h 886 Leno tel 

48h 65 20.9 Leo 

1 aae 462 Zn Ld waz 
Regression-Derived Forecast 

Selected Screened Predictors (SCRN) 

Avg Abs Ste 

Cases Br eor Dev 

24h 886 1SeuO ke 2 

48h et i 5 14.8 

Oe 462 Posie dep. 


Notice that the JTWC forecast performs better than persistence, particularlv 
at 72 h. Student-T significance tests indicate that the JTWC forecast is better than 


Ol 


persistence (95% confidence level) at all forecast hours. Because the intensity 
observations and forecasts are rounded to the nearest 5 kt value at each end of the 
change interval, discretization errors result. Therefore, no scheme is likely to perform 
with a minimum error of less than 10 kt. Although the 13 kt mean absolute error of 
the JTWC official forecast at 24 h is relatively good, this error approximately doubles 
by 72 h. 

The results of the CONV predictor equations (the basis for existing intensity 
forecast schemes) are repeated from Table 7 for comparison with the SCRN predicter 
equations. Notice that the additional contribution of synoptic predictors (wind and 
wind-shear EOF coefficients) in reducing the mean absolute error is small (0.5 kt at 
24h, 1.4 kt at 48 h and 1.3 kt at 72h). Comparison of Table 7 with Table 9 suggests 
that much of the variance explained by the WIND (or SHEAR) predictors 1s already 
contained in the selected CONV predictors. Nevertheless, the synoptic forcing 
represented by the EOF coefficients does lead to significant intensity forecast 
improvements at 48 and 72 h in this dependent data sample. 

2. Dependent-case/Independent-case subsets 

The above results based on the dependent sample may be overly optimistic, 
because the verification cases were used to derive the regression equations. Thus, the 
data cases were subdivided into dependent-case and independent-case subsets: 


e To investigate the effect reducing the sample size would have on the regression- 
derived equations for 24, 48 and 72 h intensity change, and 


e To investigate the predictive skill of the dependent-case regression-derived 
equations when applied to an independent-case data subset. 


The independent-case subset of 405 cases is constructed by selecting every third case in 
the complete data sample. The dependent-case subset is the remaining $11 cases. 

The basic approach in Fig. 4.1 (Section A above) is applied to the dependent 
sample. The resulting equations are listed in Table 10 for the 24, 48 and 72 h intensity 
change. For each forecast interval, the predictors selected first and explaining the 
largest percentage of the variance in the predictands in Table 10 are common to the 
equations derived using the complete dependent set (Table 8). In the 24, 48 and 72h 
regression equations, the selection sequence is common between the two data sets for 
the first 3,4 and3 predictors, respectively. More predictors are selected for the 
dependent-case subset equations than the complete dependent set (6, 7 and 6 versus the 
4,6 and 5 at 24, 48 and 72h). The dependent-case subset equations explain slightly 
more variance (0.42, 0.52 and 0.59 compared to 0.39, 0.51 and 0.56) than the complete 


dependent set equations. This is expected, since the sample sizes are smaller; i.e., they 
contain less of the natural variability of the ensemble of possible cases. However, 


adding more predictors may not lead to better predictions in an independent test. 


TABLE 10 


Regression equations for the change in intensity 
(kt) at 24, 48 and 72 h using the dependent-case 
data subset. Parenthetical values indicate the 
order in which the SCRN predictors were selected. 
Asterisks indicate common predictors with the 
corresponding equations for the complete dependent 
set in Table 8. The coefficients of multiple 
determination (R**2) are shown. 


Forecast Interval 


24 h 48 h ve hl 
Y-Intercept Boag Soo 1 46.44 
Predictor 
BLAT -0.40 (6) = = 
BIM12 =m (i1)* =O, Sot Ly) =Ome77 «( 1)* 
DBIM10 0.40 (2)% = = 
C4U 1 Or39 = )* 1.45 (2)* ios eet. 2 )* 
C4U 2 = C7 Zales) * ©. ee (5) 
C4U20 ee 7 Wl) = = 
C4V12 = ie Osa | 7 ) 1.34 (6) 
C4V15 1.04 (4) = = 
Gaia) _ “1.32 (4)%* “1.98 (3)*% 
$27U24 = ieee.) = 
sory 1 = 10) YAS (ey) -0.98 (4) 
Cases 588 457 346 
R**2 QO. 42 Om sz 0. 59 


The verification of the SCRN predictor equations derived from the smaller 
dependent-case subset (applied to both the dependent-case and independent-case 
subsets) is summarized relative to homogeneous samples of persistence and JI WC 


forecasts in Table 11. For ease of comparison, the verification results of the SCRN 
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TABLE 11 


Verification of persistence, JIWC and 
regression-derived (SCRN) 24, 48 and 72 h 
forecasts of western North Pacific tropical storm 
intensity (kt) using land-filter and homogeneous 
data from the complete dependent data set (CDS), 
the dependent-case subset (DCS), and the 
independent=<case subset. 


Complete Dependent Data Set = CDS SCRN Predictor Eqns 


Persistence JTWC Regression 

Av Abs Std Av Abs Std Av Abs Std 

Cases EPRos Dev Fagis@ 1: Dev Basiaous Dev 

24h 886 eZ 1S nash Seat: dee 1Sr30 11 
48 h 651 Zee emo Zl emo Lees 14.8 
72 h 462 3337 23.9 24.5 1939 ake 16e0 


Dependent-Case Subset - DCS SCRN Predictor Eqns 


Persistence JTWC Regression 

Av Abs’ Std Av Abs Std Av Abs Std 
Cases Bao Dev ldpgiaena Dev Erner Dev 

24 h 587 7 ge ee 350 A aes 12.4 102 
48 h 439 Zoe. 1933 Zee 3 16.9 18.9 14.6 
1 Ze Slee 33), 0 24.5 24. 3 Le: 20.6 16m 


Independent-Case Subset = DCS SCRN Predictor Eqns 


Persistence JTWC Regression 
Av Abs Std Av Abs Std Av Abs Std 
Cases Beis ols Dev Errew Dev EiPraoen Dev 
24 h 299 Le ae 14.3 Loree 11. 4 14.5 11.8 
48 h 212 23.2 ZA? elle: L529 2 Oma 14.9 


72h etee So el 22> Zors Se we. 1. 16mm 


predictor equations derived from the complete dependent set are also repeated from 


Table 9. The similar characteristics (mean absolute errors and standard deviations) 


between the homogeneous samples of persistence and JTWC forecasts associated with 
the complete dependent set and the dependent-case subset implies the dependent-case 
subset is a representative sample of the complete dependent set. As expected, 
regression equations derived from the smaller dependent-case subset perform better 
than the equations derived from the complete dependent sample (average absolute 
errors of 12.4,18.9 and 20.6 kt versus 13.0, 19.5 and 21.3 kt for 24, 48 and 72h, 
respectively). Thus fictitious improvement is attributed to either a dependent-case 
sample size that is too small for proper development of the regression equations, or F- 
to-enter and R? criteria that are too lenient for properly restricting the predictor 
Selection. However, when the dependent-case equations are applied to the 
independent-case subset, the good performance suggested by the dependent-case results 
is not sustained. Nevertheless the performance is better (smaller average absolute 
errors) and 1s more consistent (smaller standard deviations) than JTWC official 
forecasts at 48 and 72h. For example, the mean absolute errors in this independent 
sample are 20.2 and 22.1 kt versus 21.2 and 24.8 kt for JTWC. 
3. Subsets stratified by previous 12 h intensity 

Recall that the 12 h old best track intensity is the first predictor chosen in the 
SCRWN predictor intensity-change equations for the complete dependent set at all three 
forecast periods (Table 4). Therefore, the 1216 data cases are subdivided into terciles 
based upon the 12 h old best track intensity. This 1s a common practice in that 
conventional-predictor forecast schemes currently in use at the operational forecast 
centers are stratified by intensity. The frequency distribution of 12 h old best track 
intensity values and the tercile cut-points are illustrated in Fig. 4.2. The stratification 
scheme used to subdivide the data cases into weak, moderate and strong subsets based 
on 12 h old intensity is illustrated in Table 12. An exact division into three equal-size 
categories 1s not possible because intensity values are recorded to the nearest 5 kt. 

The basic approach in Fig. 4.1 (Section A above) is applied to each of the 
three tercile subsets. The results of the screening process on each category of potential 
predictors (CONV, WIND and SHEAR) selected for each tercile subset and each 
forecast interval are indicated in Tables 13, 14 and 15. The values in the nine columns 
on each table indicate the order in which the screened predictors were selected as the 
next dominant predictor for the associated data subset and forecast interval. As with 
the smaller dependent-case sample (Section B.2 above), more predictors generally are 


selected for each equation when the data are stratified into terciles (smaller sample 
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Figure 4.2. Tlistogram of 12 h old best track intensity (kt) 
with tercile cut-points between weak-moderate and moderate-strong storms. 


TABLE 12 


stratification scheme for the tercile subsets with 
data stratified according to ey See 1Z a 


best track intensity 


Class Cases Intensity 


Weak 399 (eee > CG 
Moderate 7 45 kt < I < 70 kt 
SL EIEO VSS | a VO.kc < | 





sizes) than are selected using the complete dependent set. This may be misleading, as it 


was with the smaller dependent-case subset, when independent cases are examined. 











TABLE 13 
















Conventional predictors (CONV) selected after 
screening regression on 24, 48 and 72 h best track 
intensity change (Kt). Stratified data sets are 
based on 12 h old best track intensity (Kt). The 
numbers indicate the order in which the predictors 

were selected. 


WEAK MODERATE STRONG 
PREDICTORS 24h 48h 72h 24h 48h 72h 24h 48h 72h 
DAYJUL Z i es Z | 
BLAT 2 4 3 g 5 
DBIM10 at at a il i Z 
BLTM12 Z Zz S 5 “ 5 
BLNM12 hae. 5 1 2 a Z 
E00 Z i 


The screening on CONV predictors illustrates several points (Table 13). 
Selection of past 12h intensity change as either the first or second 24 and 48h 
conventional predictor for all three tercile subsets implies extrapolation of the intensity 
trend is useful as a technique for the shorter range intensity forecast, but not the 72 h 
forecast. The current intensity is the first (48 and 72h) or second (24 h) predictor 
selected for all forecast periods using the strong tercile. This suggests persistence is a 
useful parameter in the forecast of stronger storms. ‘ 

The results of screening with WIND predictors using the tercile subsets (Table 
14) and the complete dependent set (Table 5) may be compared. A total of 57 WIND 
predictors are selected for the nine equations using tercile subsets, as opposed to only 
15 predictors selected for the three complete dependent set equations. Six of the 
WIND predictors (C2U14, C2V25, C4U 2, C4L16, C7V1I3 and C7V14) selected using 
the complete dependent set are not selected using the intensity-stratified subsets. This 
observation 1s surprising because C4U 2 was the second predictor selected in the 24 
and 72 h equations (fourth in the 48 h equation) for the WIND predictor screening 
using the complete dependent set (lables3)™ Purthermore, C4U 2 entered allies 
SCRN predictor equations using the complete dependent set (Table $8). 

The results of the screening with SHEAR predictors using the tercile subsets 
(Table 15) may be compared with the complete dependent set (Table 6). A total of 58 
SHEAR predictors are selected in the nine equations for tercile subsets as opposed to 
17 SHEAR predictors selected in the three equations for the complete dependent ser 
Five of the SHEAR predictors (S47U 4, S24U 2, $27V 4, §27V19 and $27V22) selected 
using the complete dependent set are not selected using the intensity-stratified subsets. 

The SCRN predictor equations for the 24, 48 and 72 h intensity are illustrated 
in Tables 16, 17 and 18, which correspond to the weak, moderate and strong subsets, 
respectively. Analvsis of the equations selected for the weak tercile (Table 16) indicates 
that lower layer wind-shear EOF coefficients are selected first (S47U11) in the 72h 
forecast equation and second (S47U 1) in the 24 h forecast equation. Selection of past 
12 h change in intensity as a predictor in the 24 and 48h (but not 72 h) equation 
suggests the usefulness of extrapolation in the short-term forecast with weak tercile 


Storms. 


TABLE 14 


Wind EOF coefficient predictors (WIND) selected 
after screening regression on 24, 48 and 72 h best 
track intensity change (kt). Stratified data sets 

are based on 12 h old best track intensity (kt). 

The numbers indicate the order in which the 
predictors were selected. 


WEAK MODERATE STRONG 
24n 48h 72h 24h 48h 72h 24n 48h 72h 
PREDICTORS 
ou 1 i u 
20 6 8 
C2U 8 a 
C2U 9 LO 
CZU13 8 
GZ01 7 Z 
C219 4 i > 
C2U20 Z 
eZ 1. > 4 > a 
e218 y 
C225 ee 3 
C2V24 9g 
C4U 1 il il a al Z 
C4U 5 8 
C4U10 2 
C4U14 6 
C4U20 3 + 
C4U22 ¢ 
C4U25 6 
C4V 2 6 
C4v 7 3 Zz 
C4V 8 10 
C4Vil 9 
C4Viz il 
C4V15 4, Z 
C4V16 LO > 
C4V19 S 3 8 
C4V20 8 
C4V23 6 
C4V24 3 
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C0 
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O74, 
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eo 
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TABEE 15 


Vertical wind-shear EOF coefficient predictors 
(SHEAR) selected after screening regression on 
24, 48 and 72 h best track intensity change (kt). 
Stratified data sets are based on 12 h old best 
track intensity (kt). The numbers indicate the 
order in which the predictors were selected. 


WEAK MODERATE STRONG 

PREDICTORS 24h 48h 72h 24h 48h 72h 24h 48h 72h 

547U 1 i Zz a S 5 

S47U 2 Z Zz 

S47U 3 y 8 

S47U 8 2 

547U10 9 

S47U11 6 iL 

S47U12 

547U14 Zz 

547U15 LO 

5S47U16 8 4 Z 

547U17 | 8 fa 0 

S47U22 2 

547U24 ae 2 

S47V 3 S S 

547V 6 ¢ 

947V 7 IE, 

547V12 2 

947V15 9 

S47V16 So 

547V18 8 

547V19 7 

S47V21 fi 

547V2Z5 8 

SZ24U 1 Hi 1 i 

SZ4U 3 Z 

sZ24U 4 5 

S24U17 8 

S24U20 LOmmeLO 

S2Z24UZ21 S 

SZ24U2Z2 i 
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TABLES 
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S$24U23 5 
$24U24 9 
S24V 2 

S24V 3 

S24V 4 

S24V 8 7 

S24V 9 

S$24V14 

S24V15 S 

S24V18 6 
S24V20 

SZ4v24 9 8 

S$24V25 4, 

SZaoe t 5 

S27U 4 

SZ 7 

S27U 9 
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S$27U18 6 
S27 025 5 
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TABLE 16 


Regression equations for the change in intensity 
(kt) at 24, 48 and 72 h using data stratified by 
12 h old best track intensity (WEAK tercile). 
Values in parentheses indicate the order in which 
SCRN predictors were selected. 
of multiple determination (R**2) are shown. 


Y-Intercept 

Prealccor 
DAYJUL 
BB EM1O 
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C4uU 1 
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TABLE May, 


Regression equations for the change in’ intensity 
(kt) at 24, 48 and 72 h using data stratified by 
12 h old best track intensity (MODERATE tercile). 
Values in parentheses indicate the order in which 
SCRN predictors were selected. 

of multiple determination (R**2Z2) are shown. 
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Alay li) wakes: 


Regression equations for the change in intensity 
(kt) at 24, 48 and 72 h using data stratified by 


12 h old best track intensity (STRONG tercile). 


Values in parentheses indicate the order in which 
SCRN predictors were selected. 
of multiple determination (R**2) are shown. 
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SHEAR predictors are not among the first six predictors chosen for any of the 
moderate or strong tercile equations (Tables 17 and 18). This might be physically 
relevant in that a weak storm will not develop with large environmental shear, but if 
the storm develops to more than 45 kt (moderate or strong tercile case), vertical wind 
shear 1s not a significant factor in deciding further intensity changes. 

Notice that the explained variance values are largest at 48h (0.51), and 
especially at 72 h (0.61), in the strong tercile equations. By contrast, the explained 
variance is lowest (0.36) at 24 h in the strong category. 

The verifications of the equations for the three terciled subsets are illustrated 
in Table 19. The verification of the complete dependent data set is repeated from 
Table 9 for comparison. The average absolute error for the 24 h forecast is smallest 
(9.3 kt) for the weak tercile and largest (13.2 kt) with the strong tercile equations. The 
average absolute errors for the 48 h equations are comparable for all three terciles 
(16.4, 17.6 and 15.4 kt). The average absolute error for the 72 h equation 1s smallest 
for the strong tercile equation (14.7 kt) and largest for the weak tercile (18.7 kt). 

The weighted-average absolute error of the regression-derived intensity 1s 


computed for each forecast period as 


_ (NyAAEY + NpAAE,, + N,AAE,) 
nani N ; 
-“total 


AAE (4.2) 


where AAE indicates the average absolute error and N 1s the sample size. The 
subscripts refer to the particular data set; 1.e., the complete dependent set (total) or a 
subset of the complete set statified by 12 old best track intensity (‘w’ indicates the 


weak, ‘m’ indicates the moderate, and ‘s’ indicates the strong tercile). 
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TABLE 19 


Verification of 24, 48 and 72 h 
tropical storm intensity forecasts (kt) 
for WEAK, MODERATE and STRONG terciles 
(stratified by 12 h old intensity) 
using SCRN predictor equations. 
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The verifications of the intensity-stratified equations are outlined in Table 20. 
Verifications of the homogeneous persistence, JTWC and complete dependent set 
forecasts are repeated from Table 9 for comparison. The weighted-average absolute 
errors indicate that the intensity-stratified equations based on dependent cases perform 
better than all other forecast schemes over all forecast intervals. Student-T tests 
confirm that the tercile subsets are significantly better (95% confidence level) than the 
JTWC official forecast at 48 and 72 h. 
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TABLE 20 


Verification of 24, 48 and 72 h tropical storm 
intensity forecasts (Kt): (1) persistence, 
(2) JTWC, (3) regression (complete set) and 
(4) regression (stratified subsets, based on 

12 old best track intensity). 


Persistence Forecast 


Avg Abs Std 

Cases Ee Yada ie Dev 

24h 886 eae ste Sie 7, 
48 h evo Ish, Mt oo 
LNG 462 Bo ee) 23.9 


JTWC Forecast 


Avg Abs 5 ed 

Cases |i ah ako) g Dev 

24h 886 Loe i eo 
48h 650 Zs ito. oO 
laa 462 24.5 tay’ O 


Regression-Derived Forecast 
screened predictors, Unstatified Data 


Avg Abs Std 

Cases Bere r Dev 

24h 886 iS ea0 ie. 2 

48 h Sali 19 14.8 

va hh 462 2s dees: 
Regression-Derived Forecast 

Screened Predictors, Stratified Data 

wr Avg Sta 

Cases Abs Error Dev 

24h 886 eae --- 

48 h Gisal 16e5 -<-- 

72h 462 yee --- 
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V. SUMMARY AND RECOMMENDATIONS 


This study is the first step in the development of an enhanced objective technique 
for predicting 24, 48 and 72h intensity of tropical cyclones in the western North 
Pacific region. The eventual goal is to develop an effective aid -for the Joint Typhoon 
Warning Center (JTWC) to forecast tropical storm intensity, particularly at 48 and 
(Pe 

The EOF coefficients of zonal and meridional components of the environmental 
wind at 250, 400 and 700 mb (Wilson, 1984) and wind shear from 250 to 400, from 400 
to 700, and from 250 to 700 mb (Meanor, 1987) are considered as potential predictors. 
Additional predictors include conventional storm-related parameters, such as date, 
intensity, motion and position. The 1216 cases in this study are 12 h data for western 
North Pacific tropical cyclones from 1979 to 1983. The basic methodology involves 
the following four steps: 


e Select a data set, 1.e., complete dependent set; independent-case/dependent-case 
subsets; or subsets stratified by 12 h old intensity; 


e Screen predictors using stepwise regression analysis to select the dominant 
predictors; 


e Generate regression equations using stepwise regression analysis and the 
screened predictors to generate regression equations; and 


e Verify the equations relative to the performance of the Joint Typhoon Warning 
Center official forecast. 


When the basic methodology is applied to a complete set (1216 cases), the 
regression equations using only conventional predictors are slightly improved by 
inclusion of synoptic forcing fields represented by the EOF coefficients. Furthermore, 
the regression equations perform slightly better than the JTWC official forecasts. 

When the equations generated using a smaller dependent-case subset (811 cases) 
are applied to the dependent-case subset, similar results are observed. Relative to a 
homogeneous sample of JTWC official forecasts, the regression equations developed 
using the dependent-case subset show progressively improved performance with 
increasing forecast interval. Despite a slight increase in the average absolute error at 
all forecast intervals when the dependent-case equations are applied to the 
independent-case subset, the performance of these equations 1s still comparable to a 


homogeneous sample of JTWC official forecasts. 
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When the basic method 1s applied to subsets stratified by 12 h old intensity, the 
regression equations perform better than the JTWC official forecast at all forecast 
intervals. These equations are significantly better (95% confidence) than the JTWC 
official forecast at 48 and 72 h. 

These results suggest that the official JTWC tropical storm intensity forecasts can 
be enhanced by application of statistical regression analysis techniques. The 
performance of the existing techniques based on conventional storm-related predictors 
can be progressively improved by using: 


¢ regression equations based on selected screened predictors drawn from EOF 
coefficient predictors of wind and vertical wind shear; and 


¢ regression equations developed from tercile subsets (the cases are statified by 
12 h old intensity) and selected screened predictors. 


The preliminary success of the screened regression equations, particularly those 
developed using the stratified case subsets, suggests EOF coefficients of the wind and 
the vertical wind-shear fields should be computed routinely using current data. With 
the EOF coefficient predictors routinely available, these objective techniques could be 
tested in an operational environment using independent cases. To further enhance 
these objective techniques, the predictive ability of other synoptic or remotely sensed 


parameters should be investigated. 
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